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A Comparative Study of Polar Code Constructions 

for the AWGN Channel 

Harish Vangala, Emanuele Viterbo and Yi Hong 


Abstract — We present a comparative study of the performance 
of various polar code constructions in an additive white Gaussian 
noise (AWGN) channel. A polar code construction is any algo¬ 
rithm that selects K best among N possible polar bit-channels 
at the design signal-to-noise-ratio (design-SNR) in terms of bit 
error rate (BER). Optimal polar code construction is hard and 
therefore many suboptimal polar code constructions have been 
proposed at different computational complexities. Polar codes are 
also non-universal meaning the code changes significantly with 
the design-SNR. However, it is not known which construction 
algorithm at what design-SNR constructs the best polar codes. We 
first present a comprehensive survey of all the well-known polar 
code constructions along with their full implementations. We then 
propose a heuristic algorithm to find the best design-SNR for 
constructing best possible polar codes from a given construction 
algorithm. The proposed algorithm involves a search among 
several possible deslgn-SNRs. We finally use our algorithm to 
perform a comparison of different construction algorithms using 
extensive simulations. We find that all polar code construction 
algorithms generate equally good polar codes in an AWGN 
channel, if the design-SNR is optimized. 

Keywords —Bhattacharyya bounds, bit-channels, Gaussian ap¬ 
proximation, polar codes 

1. Introduction 

Polar codes have been the subject of active research in recent 
times, mainly due to the fact that they are the first ever 
provably capacity achieving codes, with explicit construction 
and very low complexity of encoding and decoding. The polar 
codes were invented by Erdal Arikan [1], using a novel concept 
called channel polarization. Soon after, both the concept of 
channel polarization as well as polar codes have been extended 
to a number of applications and generalizations [13]-[27]. 

Let us consider a binary input discrete memoryless sym¬ 
metric (BI-DMS) channel. Channel polarization is a technique 
by which one manufactures N polarized channels (called bit- 
channels) out of N identical independent copies of BI-DMS 
channels. The channels are polarized without any loss of 
capacity, in the sense that they are either extremely noisy 
or noiseless as V —> oo. Then one can easily achieve a 
rate of transmission close to capacity, simply by choosing 
to transmit over only the good bit-channels. However, at any 
finite blocklength N and rate R = K/N, a ranking algorithm 
for the bit-channels according to their bit error rate (BER) 
becomes necessary to select K good channels out of N. Here, 
K is the number of information bits in each code word of 
length N. This selection of bit-channels completely defines a 
polar code and therefore is called the polar code construction. 

The polar code construction is critical to obtain the best 
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performance at finite blocklengths. As we mentioned, the polar 
code construction has an explicit definition in theory. It is 
challenging in practice because precise estimation of the bit- 
channels is intractable. Therefore, a wide range of approximate 
construction methods are proposed in [1]-[12]. 

An important characteristic of polar codes is their non¬ 
universality. That is, different polar codes are generated de¬ 
pending on the specified value of signal-to-noise ratio (SNR), 
known as the design-SNR. A change in operating SNR is 
possible in practice but a change in code according to SNR 
is not desirable. Therefore we wish to construct a polar code 
at one design-SNR and use it for a range of possible SNRs. 
As we see later, the choice of design-SNR is critical for the 
performance at all SNRs of interest. Unfortunately, there has 
been no study to identify the best design-SNR for any polar 
code construction. 

A major issue with polar codes has been their inferior BER 
performance at finite blocklengths, compared to the state-of- 
the-art LDPC and Turbo codes of similar blocklength [28]. 
Several ideas have been proposed to overcome this issue. One 
may use a list decoder [28] to overcome this problem but this 
comes at the expense of an increase in decoding complexity. 

In this paper, we aim at finding the best polar code con¬ 
struction algorithm among a range of available algorithms 
over a binary input additive white Gaussian noise (BI-AWGN) 
channel. We first propose a simple search algorithm to find 
the best design-SNR for each construction algorithm. Then 
we can find the best code construction algorithm among all. 
We find in our extensive simulations that all construction 
algorithms produce equally good polar codes when design- 
SNR is optimized. 

The rest of the paper is organized as follows. In Section II, 
we present brief introduction to polar codes and outline our 
notation and channel model. In Section II, we describe the 
encoding, successive cancellation decoding and the construc¬ 
tion of polar codes. In Section III, we review four main polar 
code construction algorithms which form the basis for several 
other variations. In Section IV, we discuss the non-universality 
of polar codes. Our discussion includes a proposal to fairly 
compare all construction algorithms. In Section V, we give a 
detailed description along with pseudocode for efficient imple¬ 
mentations of polar code constructions. We finally present our 
simulation results in Section VI and conclude in Section VII. 

II. Polar Codes 

Given any subset of indices X of elements of a vector x, 
we denote the corresponding sub-vector as xx. Similarly, 
when X denotes the indices of columns of a matrix A, the 
corresponding sub-matrix is denoted Ax. 

A polar code may be specified completely by {N,K,F), 
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Fig. 1. Illustration of Arikan’s 0(Allog2 N) complexity encoder implemen¬ 
tation of (2) with (N, K, T) = (8, 5, {0, 2,4}) 

where N is the length of a code word in bits, K is the number 
of information bits encoded per codeword, and is a set 
of N — K integer indices called frozen bit locations from 

Encoding — For an {N, K, F) polar code we describe below 
the encoding operation for a vector of information bits u of 
length K. The rate of the code is i? = K/N. Let n = log 2 (iV) 
and F®" = F 0 • • • 0 F (n copies) be the n-fold Kronecker 
product of Arikan’s standard polarizing kernel F = i]- 

Then, a codeword is generated as 

X = G • u = (F®")^, • u, (1) 

where = {0,1,..., — 1}\-F corresponds to the set of 

non-frozen bit indices and G = (F®")^^^ is the generator 
matrix of polar code. A compact alternative form is 

X = F®"d (2) 

where d € {0,1}^ is such that d_F = 0 and d_Fc = u. 

Note that djr is the set of frozen bits as defined in 
Arikan’s original formulation [1] and is taken here as zeros. 
Arikan also proposed an efficient implementation of complex¬ 
ity ©(TV log A^) of the encoding (2) as shown in Fig. 1. 
Modulation and Channel Model — Our channel is a BI- 
AWGN channel, with zero mean and variance Bits in x 
are modulated as x using binary phase shift keying (BPSK). 
It maps 0 —>■ —y/REi, and 1 —> +y/REh, where E^ denotes 
the energy spent per each information bit. We thus obtain the 
following channel 

y = X + n. (3) 


Without loss of generality, we normalize the noise variace 
to be unity for the AWGN in all our future discussions. 
Successive Cancellation Decoder (SCD) — The SCD algo¬ 
rithm [1] essentially follows the same encoder diagram in 
Fig.l using decoding operations that resemble one iteration 
of the classic belief propagation algorithm. The likelihoods 
evolve in the reverse direction from right-to-left, using a pair 
of likelihood transformation equations, as illustrated with an 
example in [1]. Then the bit decisions are made at the left 
end of the circuit and broadcasted to the rest of the circuit. A 


complete pseudocode for implementing an SCD is available 
in [29]. The overall complexity is only ©(A^log 2 W). 

The Polar Code Construction — The choice of the set is a 
critical step in polar coding (i.e. the polar code construction). 
This corresponds to the selection of best K bit-channels 
among N, in terms of the bit error rate (BER) at a given 
value of (REb/No) defined as the design-SNR. 

Since the exact BER of bit-channels is intractable, several 
approximations are used. This leads to many different con¬ 
structions reviewed in Section III. The detailed algorithms are 
given later in Section V. 

III. Overview of Current Literature on 
Polar Code Constructions 

The earliest construction of polar codes is based on the 
evolution of simple bounds on the Bhattacharyya parameters 
of bit channels [2]. Though these bounds are proved in [1] only 
for BI-DMS channels, they may also be extended for infinite 
alphabet channels such as BI-AWGN. Although the bounds 
are loose for many of the bit-channels and more accurate 
methods are devised later, the codes designed using such 
bounds exhibit good performance. This construction enjoys 
the least 0{N) complexity among all (excluding the selection 
of K best among N metrics obtained). This includes 2N — 1 
transformation operations corresponding to the transformation 
of upper bounds by polarizing kernel. This well-known method 
was also studied in a recent paper [7]. 

Another earlier construction is proposed in [1], based on 
a Monte-Carlo simulation of the bit-channels. This can be 
applied to a wide range of channels including finite and infinite 
alphabet channels such as AWGN. However, the algorithm has 
the greatest complexity O {MN log N) among all, where M 
is the number of iterations of the Monte-Carlo simulation. 

A more recent construction algorithm was proposed by Tal 
and Vardy [3] based an earlier proposal from Mori and Tanaka 
[5], [6]. At first it was proposed to find the bit channels 
by evaluating of their full finite alphabet distributions. The 
algorithm becomes intractable due to the explosion of the 
alphabet size to a power of N by the end of n channel 
transformations. This problem is specifically addressed in 
[3] by employing a novel low complexity close-to-optimal 
quantizer. In addition, they provide theoretical guarantees for 
the loss of performance due to the quantization. Note that, 
some channels are better estimated by simple bounds on 
Bhattacharyya parameters [1], [2]. Hence, in [3] the authors 
conclude a final algorithm by improving the BER estimates 
with the Bhattacharyya parameters whenever they are better. 
The final algorithm is considered by far the most accurate 
construction algorithm available with theoretical guarantees. 

The algorithm is extendable to infinite output channels by 
using a quantization algorithm. In [3], the authors propose 
a quantization algorithm for AWGN channels. When the 
channel output has p, symbols, the final complexity of the 
algorithm is 0{N ■ log /x) (excluding the selection of K best 
among N metrics obtained). This contains 2N — 1 number 
of 0{p^) complexity bit channel convolutions plus 2N — 1 
number of 0{p^ log p) complexity quantizer operations. The 
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quantizer uses an intelligent data-structure that combines a 
heap and a list. The initialization of the algorithm involves 
the AWGN channel quantization to fi symbols, which takes 
an additional 0(/i) complexity. Overall, this algorithm has 
the second largest complexity, only next to the earlier Monte- 
Carlo based construction algorithm by Arikan. 

For AWGN channels, the estimation of bit-channels based 
on Gaussian approximation is proposed in [4]. This enables 
to use the Gaussian distribution approximations on the inter¬ 
mediate likelihoods. This was found to well-approximate the 
actual bit-channels of polar codes [8], [9]. A similar algorithm 
after the original proposal in [4] was studied in [8], [9]. 

The Gaussian approximation algorithm takes a complexity 
of 0{N) function computations (excluding the selection of K 
best among N metrics obtained) similar to the Bhattacharyya 
bounds based algorithm, but involves relatively higher com¬ 
plexity function computations. Overall, this construction algo¬ 
rithm enjoys the second least complexity. 

There also exist several other heuristic constructions, ex¬ 
tended or inspired from the above constructions, e.g. [10]- 
[12]. However, these methods are less interesting due to poorer 
performance, higher complexity, and having no theoretical 
guarantees. Finally, constructions for different channels are 
also available in [21], [30]-[32], for different input alphabets 
[33], different kernels [34]-[36] and concatenated codes [37], 
[38], which fall out of the scope of this paper. 

IV. Non-Universality OF Polar Codes 

In coding theory, most of the codes are universal in the sense 
that their definition is independent of the channel SNR, but 
polar codes are different. Arikan defines the set if of polar 
codes such that the block error rate (BLER) of polar codes 
is minimum under SCO. Since BLER is a function of SNR, 
it is not very surprising that the polar code changes with the 
given design-SNR. Later in our simulations, we see that the 
change is very significant in terms of performance. There 
are a few recent attempts to design universal polar codes 
[27], [32], [39] but they come at a cost of much higher 
complexity at decoder and/or encoder. Eurther development 
of the theory of universal polar codes is required for their 
practical significance. In this paper, we restrict our attention 
towards Arikan’s original polar codes only. We aim to design 
a polar code at a particular design-SNR and use it for a range 
of SNRs due to the following reasons. 

1) In a number of experiments it is evident that the perfor¬ 
mance of polar codes constructed at one design-SNR is 
good for a range of SNRs (see for e.g. [28]). 

2) The construction algorithms are not optimizing the per¬ 
formance exactly at the design-SNR. That is, better 
performance at a given SNR may be obtained by con¬ 
structing the code at a slightly different design-SNR (see 
Section VI). This means even if we update the code 
dynamically with SNR, the performance may not be 
optimal. 

As a result, the problem reduces to finding at what design- 
SNR, we should design the polar code. Unfortunately, to the 
best of our knowledge, there is no such study of polar codes in 


this direction. Many research works often consider a heuristic 
choice of design-SNR. 

In Section VI we see that the choice of design-SNR is 
indeed critical for its performance at all SNRs. Eurther, we find 
that such a performance depends on many parameters such as 
rate, blocklength and the algorithm used for the construction. 
This observation makes an exhaustive search for the design- 
SNR inevitable. The following simple search algorithm is 
proposed to find the best design-SNR. 

1) Consider a set of SNRs {S'!, S 2 , ■ • ■, -Sm} that covers 
the range of SNRs of interest. 

2) Design m polar codes at design-SNRs equal to Si, i = 

(i.e. find an if at each Si), using any given 
construction method. 

3) Plot the performance curves BER vs. SNR or BLER vs. 
SNR of all above polar codes. 

4) Select the curve that best suits the needs of the target 
application and declare the corresponding SNR for its 
construction. 

We may use the search for different construction algorithms 
and make a fair comparison of performance among all. Ex¬ 
tensive simulations of this comparison strategy are presented 
in Section VI. 

V. The Polar Code Construction Algorithms 

In this section we review all four important polar code 
construction methods denoted PCC-0, PCC-1, PCC-2, PCC- 
3, and provide full pseudocode implementations. The corre¬ 
sponding bit-channel metrics generated from each algorithm 
are denoted z^'^\ z^^'i respectively. 

We should use logarithmic domain (or log-domain in short) 
calculations especially at high blocklengths such as N > 256 
to avoid underflow. Eor simplicity we mention linear domain 
formulas only. 

A. PCC-0.' Arikan’s Bhattacharyya bounds of bit channels 
This earliest construction is from Arikan [1], [2], using Bhat¬ 
tacharyya parameters. He proved in [1, Appendix-D] that a 
pair of upper bounds on the Bhattacharyya parameters of bit- 
channels evolve as simply as {z, z} —> {2z — z^, z^j at each 
polarizing transform F. Due to its simplicity, this construction 
has been widely used, and produced good polar codes. 

There is an important modification to be used over [2], 
which is due to the non-universality of polar codes as ex¬ 
plained in [8]. The original recursive algorithm requires an 
initial value and this was proposed by Arikan as 0.5 [2], cor¬ 
responding to the worst BER. This initial is actually the Bhat¬ 
tacharyya parameter of the underlying BI-AWGN channel, 
therefore it may be replaced with exp{—REi,/No) [8]. Now, 
the original initial 0.5 will be obtained at = —1.5917dB. 
The final algorithm is given below as Algorithm PCC-0. 

A sample run of PCC-0 reveals that some of the bounds 
quickly increase and may even increase beyond 0.5 (but always 
< 1). This suggests a very inaccurate channel estimation, 
which motivated several alternate constructions. However, the 
advantage of the alternate constructions was not characterized 
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well in literature. 

Algorithm PCC-0 ; The Bhattacharyya bounds 

INPUT : N, K, and design-SNR EdB = {REi,/No in dB) 
OUTPUT: C {0,1,.. ., Af - 1} with \E\ = N - K 

1: 5 = ^ JY 

2 : S , initialize = exp (—5) 

3: for j = I : n do > for each stage in Fig. 1, right-to-left 

4 : u = 2^ 

5: for f = 0:^ — Ido > For each connection 

6: r=z(°)[t] 

7: z(®) [t] = 2T — C> Upper channel 

8: z(®) [tj/2 + t] = > Lower channel 

9 : end 

10 : end 

11 : E = indices of greatest elements^z^®) , N — 

I I Find indices of the greatest N — K elements 

12 : Return E 


Function indices_of_greatest_elements(v, 1) 

INPUT : Vector v of dimension |v| X 1 and integer I 
OUTPUT: X, an / X 1 integer vector containing I indices in 

1 : [v, idx] = Sort(v, ’descending’) 

//obtain in idx, the |v| indices of vector v when sorted in 
descending order 

2 : X = idx[0 ; Z — 1] > Store the first I indices 

3: Return X 

// Note: This is a simple implementation which is okay for 

// |v| up to a few thousands. For optimal performance, one should 

// use more advanced selection algorithms [40], [41]. 


B. PCC-1.’ Arikan’s Monte-Carlo estimation of bit channels 
A Monte-Carlo estimation of the bit-channel metrics is pro¬ 
posed in [1]. As a simulation based algorithm, it can be applied 
to a variety of channels. 

We consider two specific improvements to the original 
proposal in [1] as follows. 

1) We simulate all-zero codeword transmission only, since 
the polar code is a linear code. Each iteration is now 
equivalent to an SCO with all bits treated as frozen. 

2) We calculate BER of bit-channels rather than their Bhat¬ 
tacharyya parameters [1, Eq. (54) and (80)]. 

The first modification simplifies the algorithm by avoiding 
the encoding operation for each simulated transmission, in 
addition to avoiding the step of updating the bit decisions in 
SCD. The second modification improves the precision of the 
estimate, simply because we use the exact BERs. This reflects 
in the overall BEER equation [9, Eq. (3)] which is relatively 
more accurate than [1, Eq. (54) and (65)]. The complexity 
reduces to half due to these modifications. Also, as mentioned 
earlier the likelihood operations (step-7, step-12 of UpdateL 
and step-7,step-l 1 of PCC-1) are given in linear domain, but 
it is preferable to perform these operations in log-domain. 

One particular disadvantage of this construction is the 
accuracy of the construction, which is limited severely by 


the Monte-Carlo iterations M. More specifically, the channels 
with BER less than or close to 1/M receive highly unreliable 
estimates. There exist many bit-channels that have extremely 
low BER, due to the polarization effect. Most of such good 
channels will receive a zero estimate, which makes a compar¬ 
ison difficult. 

When the rate is high enough, the construction works well, 
since the good channels are always chosen for the information 
transmission. This avoids the need of any comparison among 
the good channels. On the other hand, when the rate is 
very low, the choice will only be among channels that tend 
to be very good. In that case, any choice would result in 
approximately the same performance. 


Algorithm PCC-1 ; The Monte-Carlo estimation 


INPUT : N, K, the design-SNR EdB = RE^/No in dB, 

M = Monte-Carlo size, randriQ — a standard 
Gaussian pseudo random number generator 
OUTPUT: X’ C {0,1,..., Af - 1} with |X’| = A^ - if 

Allocate and make visible to the other functions TV X (n -F 1) 
matrices B, L and set B = 0 
Bit-channel metrics c £ initialize c = 0 
n = log 2 TV and S = 
for i = 1 : TlX do 

y = —s/S + randn(N, 1) > normalized channel, all-zero tx 

L = NaN > Reset L 

Initialize the last column of L: 

Ly][n] =Pr(yj|0)/Pr(yy|l) = exp (-2t;jVS) Vj 
for 7 = 0,1,. .. , TV — 1 do > The SCD with all frozen 

I = bitreversal(T) 

UpdateL(Z, 0) > Update L, esp. L[T][0] 

0, if L[Z][0] > 1 
1, else 

end 

c = c + d > Real addition 

end 

z^^) = c/M > Monte-Carlo averaging for BERs (not necessary) 
E = indices_of_highest_elements , TV — k'^ 

// Find indices of the highest N — K elements 

17: return E 


d[Z] = 


C. PCC-2.’ Tal & Vardy’s estimation of bit-channel TPMs 

Tal & Vardy’s construction algorithm [3], attempts to estimate 
the full transition probability matrices (TPMs) of bit-channels, 
instead of only estimating BERs. The desired BERs may 
be estimated from the TPMs. In spite of this effort, some 
estimates may be looser than the simple Bhattacharyya bounds 
calculated in PCC-0. This motivates to combine these two 
methods and come up with a hybrid algorithm that gives 
better estimates of the bit-channels compared to the individual 
algorithms. The final algorithm is denoted as Tal & Vardy’s 
construction for polar codes [3]. Like before, we compute 
BERs of bit-channels instead of Bhattacharyya parameters for 
improved accuracy. 

When the channel is symmetric we can consider only the 
half of the channel. So when we say that the output alphabet 
is of size /i and its TPM is of dimension 2 x /i, we are actually 
referring to a symmetric channel of 2/i output symbols and its 
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Function UpdateL(i, j) : Recursive LR comp, of SCD 


8 : 

9 : 

10 : 

11: 

12 : 

13 : 

14 : 


INPUT : Element indices i,j 
OUTPUT: Recursively updated matrix L 


u = and I = (i mod u) 

i{ I < u/2 then 

if (L[i][j + 1] = NaN) then 
^ UpdateL(i, j + 1); end; 


if (L[i + u/2][j + 1] = NaN) then 
^ UpdateL(i + u/2, j + 1); end; 


+ l]L[i + u/2][j + 1] + 1 
+ 1] + L[i + u/2\[j + 1] 


if B[i — u/2][j] = 0 then 

L[*]bi = L[*]b + l]L[i - u/2][j + 1] 

else 

LWb +1] 


Lfiim = 


L[i-u/2][/ + l] 


end 


end 


> Upper branch 


> Lower branch 


TPM of dimension 2 x 2/i. 

The output size of bit-channels grows rapidly, therefore a 
quantizer algorithm is proposed to control its size. It will then 
be used whenever the alphabet size exceeds a threshold p. 
The quantization is performed such that the channel’s BER 
is preserved and the capacity of the quantized channel is 
maximized. In later part, we discuss the following components 
of the overall construction. 


1) Quantization of the AWGN channel to initialize the code 
construction. 

2) The bit-channel convolutions corresponding to each use 
of Arikan’s kernel F. 

3) The main quantizer algorithm for controlling the size of 
the alphabet after each bit-channel convolution. 

4) The main construction algorithm utilizing several com¬ 
ponent algorithms discussed before. 

Quantization of the AWGN channel — Given an AWGN 
channel with BPSK modulation as defined in (3) (with nor¬ 
malization), we may quantize its infinite alphabet to size /r 
before we use Tal (& Vardy’s construction algorithm. Such a 
quantization algorithm is given in [3], and reviewed below. 

The quantization involves first finding p. consecutive semi¬ 
open intervals of the positive real line IR+. As explained below, 
the intervals are the pre-images of p partitions of uniform 
length of [0,1] (the range of instantaneous capacity function). 
The instantaneous capacity function C'[A(t/)] is given by: 

C'[A(t/)] = 1 - log2 (1 + A) -f log2 A; (4) 

where 


Hy) = 


/y(j/|0) 


(5) 


In an AWGN channel according to (3) under normalization, 
the conditional densities are: /f('|0) ~ Af 


/y(-|l) ~ M l) ■ Therefore, 



( 6 ) 


We then obtain p -f 1 consecutive points (oq = 0) < oi < 

• • • < Op-i < (a^ = oo) by solving the following equation. 
These points {oi} will partition the entire positive real line 
into p intervals that correspond to p quantized symbols, such 
that 

C[A(aQ] = ^, V/ = l,...,p-l (7) 

Finally the entries of the 2 x p dimensional TPM P = [pif of 
the quantized channel (with an input alphabet {0,1} and an 
output alphabet {0,1,..., p — 1}) are obtained by integrating 
the corresponding channel distribution over the respective 
intervals as below. 

Pij = Pr(j is received | i is transmitted) 


frivli) dy 


( 8 ) 


and 


Q{x) = / exp{-x'^/2)dx 

J X 


A full pseudocode implementation of the overall quantization 
of AWGN channel to /x symbols is given below. 


Function discretizeAWGN() 

INPUT : Quantization size /r, design-SNR EdB = RE^/Nq in 
dB 

OUTPUT: TPM P of size 2 x 


1 : 

2 : 

3 : 

4 : 

5 : 

6 : 

7 : 


9 : 

10 : 

11 : 

12 : 


s = 

\(y) = exp (^—2y\/2S^ and > Eq. (6) 

C{x) = 1- log 2 (l -l-a;) + xlog 2 (a;)/(l + x) > Eq. (4) 

a £ R^, initialize a[0] =0 and a[p] = oo 
for jr = 1 : /i — 1 do 

a[j] = solve{C(A(j/)) = j/fi} 

end 

for / = 0 : /i — 1 do > Eq. (8) 

P[0][/] = Q + a[j]) - Q + a[j + 1]) 

P[i]b1 = <3(-v^ + abl) -Q(-v^ + 4i + i]) 

end 

Return P 


The bit-channel convolutions — Each recursive use of the basic 
kernel F = [ g } ] generates two polarized channels from a 
pair of identical channels W : W —■ y. All the channels are 
represented by TPMs of two rows. 

The pair of polarized channels are obtained by two self con¬ 
volution operators ffl and H, referred to as upper-convolution 
and lower-convolution, respectively (0 denotes usual binary 
EX-OR). The corresponding definitions are below [1], [3]. 


WfflW(yi,y2 |i) = ^{w(t/i|0)W(2/2|0©t) 0 

W(2/i|l)W(2/2|l©i)},Vt/i,2/2 GJ' (9) 


My) = exp 
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WMW{yi,y2,b |i) = ^W{yi\i)W{y2\b ® i) 

V 2 /i,y 2 e and 6 g {0,1} (10) 


Proper care must be taken in applying these convolutions, to 
have all columns with likelihoods > 1 (symmetric half of the 
channel). Whenever this gets violated, we simply swap the two 
probability values in the column, which brings its symmetric 
symbol in place. Another issue is whenever there are two or 
more columns with the same LR, we merge them to one by 
simply adding the columns. 


Finally, we can see that the size of output alphabet of both 
the new channels is This increase in alphabet size is 

indeed high, which soon becomes intractable as we apply this 
transformation recursively. Hence the quantizer algorithms are 
essential. 


Function upperConvolve() 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


INPUT : A 2 X /2 TPM P of channel to get convolved 
OUTPUT: TPM Q of size 2 x /r' < + l)/2 

Allocate Q G and initialize idx = — 1 

for 2 = 0 : /i — 1 do 
idx = idx + 1 

Q[0]Mx] = (P[0][*]2 + P[l][i]2)/2 
Q[l][idx] =P[0][i]P[l][i] 
for j = 2 + l:/ 2 — Ido 
idx = idx + 1 

Q[0]Mx] = P[0][i]P[0][i] + P[l][i]P[l][i] 
Q[l][idx] = P[0][i]P[l][i] + P[l][i]P[0][i] 
if Q[0][i(ia::] < Q[l][idx] then 
^ swap(Q[0][i(ia::], Q[l][idx]); end; 

end 

end 

Merge the columns of Q with same LR (add columns) 
Return Q 


The quantizer algorithm — A critical part of the construction 
algorithm is to find a good quantization algorithm, which was 
given in [3], as discussed below. This quantization algorithm 
requires a special data-structure which is a combination of a 
heap and a linked list. We call this data-structure a heaplist. 

A heaplist is a data-structure that holds L symbol proba¬ 
bilities (columns) from the TPM as a list and L — 1 loss-of- 
capacity values as a heap. These values in heap result from 
several symbol-merger operations, defined below. 

One main objective of the data-structure is to enable a low 
complexity 0{\ogL) operation for extracting the minimum of 
the values in heap, while simultaneously maintaining the list 
of L symbols in a sorted order of likelihoods. 

A symbol-merger operation is the critical component of 
this quantization algorithm, and is a part of the heaplist. 
This operation is allowed only on two consecutive symbols 
in the list. A merger of two columns in the list replaces 
the two columns by a single column equal to their sum 
corresponding to one new symbol, simultaneously adjusting 
the heap accordingly. Each merger reduces the size of the 
heaplist by one element and changes the heaplist. 

Overall, a heaplist is simply a data-structure providing three 


Function lowerConvolve() 

INPUT : A 2 X /I TPM P of channel to get convolved 

OUTPUT: TPM Q of size 2 X p, p < n-iti + 1) 

1 : Allocate Q g r2xh(m+i) and initialize idx = — 1 

2 : for i 

= 0 : /i — 1 do 

3: 

idx = idx + 1 

4: 

Q[0][i(ia;] = P[0][j]2/2 

5: 

Q[l][idx] = P[l][f]V2 

6: 

idx = idx + 1 

7: 

Q[0][idx] = P[0][j]P[l][i] 

8: 

Q[l][icix] = Q[0][jdx] 

9: 

for j = 2 -|-l:/i— Ido 

10: 


idx = idx + 1 

11: 


Q[0][fdx] = P[0][j]P[0][i] 

12: 


= P[l] [j]P[l] [j] 

13: 


if Q[0][i(ia:] < Q[l][idx] then 

14: 


^ swap(Q[0][i(ix], Q[l] [idx]); end; 

15: 


idx = idx + 1 

16: 


Q[0]Mx] = P[0][t]P[l][i] 

17: 


Q[l][fdx] = P[l][*]P[0][i] 

18: 


if Q[0][ida:] < Q[l][idx] then 

19: 


^ swap(Q[0][ida;], Q[l][icix]); end; 

20: 

end 

21 : end 


22 : Merge the columns of Q with same LR (add columns) 

23: Return Q 


main operations described as follows. 

• initialize_heaplist{)\ Given a 2 x L TPM, initialize 
the list part of the data-structure with its columns in the 
increasing order of likelihoods, all > 1. Involves a sorting 
operation with 0{L\ogL) complexity. The heap part of 
the data-structure is stored with the L — \ values of loss 
in capacity when two consecutive symbols in the list are 
merged. These L — 1 values in heap are tightly attached 
to the first L — 1 values in the list. 

• minloss_index{): Find the value of minimum loss-of- 
capacity, and return the index of the column in the list 
attached to the minimum loss. The index indicates the 
optimal symbol-merger, which merges the column and 
its next. 

• merge_at_index{): Perform the merger operation of the 
two elements in list, present at the given index and its 
next. This will result in the reduction of size of the heap 
and the list, by one element. One entry in the list gets 
updated and one entry gets removed. Accordingly, a few 
entries in the heap will be updated. 

• Miscellaneous: 

size{) - to know the number of columns of the TPM 
currently maintained within the heaplist. 

TPM{) - to extract the TPM from heap, in a two-row 
matrix format. 

has_duplicates{) - true or false based on whether there 
are columns in list with same likelihood ratio. 

The Overall Implementation and Pseudocode — The following 
few steps are involved in the main algorithm to generate N 
symmetric bit-channel TPMs. 
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Function Quantize_to_size() 

INPUT : TPM P of size 2 X L, quantization size fj, < L 
OUTPUT: Quantized TPM of size 2 X fi' 

1 : Instantiate a heaplist object H 
2 : H.initialize_heaplist{P) 

3: while H.sizeQ > fj, ov H.has_duplicates{) do 
4: idx = H .minloss _index() 

5: H.merge_at_index{idx) 

6: end 

7: Return H.TPMQ 


1) Initialize the list with one finite alphabet channel equal 
to the quantized version of the AWGN channel having 
alphabet size /x. 

2) For each finite alphabet channel in the list, repeat the 
following: 

a) Perform upper and lower convolutions of the channel 
with itself to generate two new channels of higher size. 

b) Quantize the two resultant channels to the size p and 
replace the one original channel. 

3) If the number of channels is less than N, then repeat step 
2, otherwise go to step 4. 

4) Calculate the BER of each channel and compare with 
its Bhattacharyya parameter calculated using PCC-0. 
Declare the least value as the bit-channel metric. 

The pseudocode of the same is provided as algorithm PCC-2. 
This represents the full construction algorithm of Tal & Vardy 
[3], except that we compute exact-BER of the bit-channels 
instead of their Bhattacharyya bounds for the comparison. This 
improves the precision of bit-channel metrics before they are 
compared and the greatest N — K values are selected. 

D. PCC-3.' Trifonov’s Gaussian approximation of bit channels 
Gaussian approximation for polar code construction was first 
proposed by Trifonov [4] and rediscovered in [8], [9]. 

The basic idea is to estimate all the log-likelihood ratios 
(LLR’s) at intermediate stages as Gaussian variables. This 
simplifies the analysis by requiring only to calculate the mean 
and variance of the LLRs at each stage of decoding. The bit 
channel metrics in this case are different. We estimate a bit- 
channel metric proportional to the argument of a Q-function, 
which represents its BER under Gaussian approximation [9]. 
So while being sufficient for comparison, the metrics in 
are not BERs but are inversely proportional to the BERs of 
bit-channels unlike the metrics from other constructions. 

The following interpolation function and its inverse are 
essentially used to perform this construction (as in [42]). 

f exp(-0.4527a;° ®® -f 0.0218) if 0 < a; < 10 

m = \ 

i “ ^)exp(-a;/4), if x > 10 

( 11 ) 

Eull pseudocode to implement this algorithm is given below. 
We may employ bisection method to find the inverse of the 
function (j){x) (we observed that for this special function, 
bisection method works faster than Newton-Raphson method). 


Algorithm PCC-2 : The full TPM estimation 


1 : 


2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 


22 : 


INPUT : N, K and design-SNR EdB = {REb/No in dB); 

is vector of bit-channel metrics from PCC-0 
OUTPUT: C {0,1,..., TV - 1} with \E\ = N - K 

Allocate channels, an array of TV TPMs, each of dimension 
2 X {I. chi, ch2 are to hold intermediate TPMs. 

n = log 2 TV 

channels[0] = discretizeAWGN{fi, EdB) //Initialize 

for j = 1 : n do > for each stage in Fig. 1, right-to-left 

u = 2T 

for T = 0 : ^ — 1 do 

chi = upperConvolve(c/ianneZ5[f]) 
ch2 = \o'WGrConvoWG(channels[t]) 
channels[t] = Quantize_to_size(c/il,/i) 
channels[u/2 + f] = Quantize_to_size(c/i 25 /i) 

end 

end 

Allocate a real vector dimension TV x 1 
for T = 0 : TV — 1 do 
[7] = 0 

= no. of columns in channels[i] 
for j = 0 : fi' — 1 do 
I = z^^^[T] + {channels[i])[l][j] 

end 

z^^) [T] = min [T], z^^^ [T]} 

end 

E = indices_of_greatest_elements 
I I Find indices of the greatest N — K elements 

Return E 


Algorithm PCC-3 : The Gaussian approximation 

INPUT : N, K and design-SNR EdB = (REh/No in dB) 
OUTPUT: C {0,1,..., Af - 1} with \E\ = N - K 

1: S = \QX:dB/\0 ^ jy 

2 : g initialize z(®)[0] = 4S 

3: for j = 1 : n do > for each stage in Fig. 1, right-to-left 

4 : U = 2i 

5 : for 4 = 0 : ^ — 1 do 

6: T = z(3) [t] 

7: (l — (1 — > Upper channel 

8: z^^fu/2 +1] = 2T > Lower channel 

9: end 

10 : end 

11 : E = indices_of_least_elements^z(®), N — 

// Find indices of the least N — K elements 

12 : Return E 

Function indices_of_least_elements(v, Z) 

INPUT : Vector v of dimension |v| X 1 and integer I 
OUTPUT: An I x 1 integer vector containing I indices in 

1 : Return Andices_of_greatest_elements(—v, 1)) 


VI. Simulations and Discussion 

In this section we consider comparing the performance of 
polar codes from various polar code constructions. Pigs. 2 
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to 5 present the performance of polar codes produced by 
each of algorithms PCC-0 to PCC-3 at TV = 2048 and 
R = 0.5. Different curves in each figure represent the polar 
codes constructed at different design-SNRs. Clearly, design- 
SNR is critical for all construction algorithms to generate 
polars code with a good performance. Interestingly, PCC-3 
shows high performance variations with design-SNR. 

We may observe that at high design-SNRs, the performance 
degrades with an increase in design-SNR for all the construc¬ 
tion algorithms. In fact in the current example, PCC-2 and 
PCC-3 followed this more precisely and performed the best 
at the least design-SNRs. The least design-SNR we considered 
is —1.5917dB, at which PCC-0 takes the worst-case initial 0.5. 

As discussed in Section IV, the optimal design-SNR is 
a function of all possible parameters such as rate, block- 
length and construction algorithm. We therefore select the best 
design-SNR for each construction algorithm from the above 
graphs and then make a fair comparison. This comparison is 
shown in Fig. 6. We see that if we can find the optimal design- 
SNR, any polar code construction algorithm produces polar 
codes of equally good performance. 

VII. Conclusions 

We have presented a comprehensive survey and full pseu¬ 
docode implementations of all the well-known construction 
algorithms. We then proposed a simple discrete search to find 
the best design-SNR for any given polar code construction 
algorithm. We then compared various polar code constructions 



Fig. 5. PCC-3: The effect of design-SNR at A^ = 2048 and R = 0.5 

and concluded that all are equally good in AWGN if the 
design-SNR is optimized for the best performance. Thus in 
future, we may use simple algorithms only. 
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